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Abstract 



r^ ' In this paper we consider multivariate time series obtained as solution to multidimensional 



nonlinear stochastic difference equations, whose coefficients are allowed to be locally de- 
generate and to present discontinuities. We provide simple and easy to check sufficient 



CNJ , conditions for the irreducibility, T-chain regularity and geometric ergodicity of these pro- 
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1 Introduction 

Let us consider a system of nonlinear stochastic difference equations 

Xt = f{Xt-i) + giXt-i)et, t>l 



'corresponding author 



where / : M" -^ M", g : W^ ^ M"^'^, {et,t G N} is a sequence of independent, identically 
distributed fc-dimensional random vectors and Xq is a given random vector. We would like to 
find simple and easy to check conditions that ensure the solution to be an irreducible, T-chain 
and a geometric ergodic process. The interest on these kind of models is clear: first of all 
they can be thought as the discretization of a multidimensional stochastic differential equation 
and the properties of the discretized models are usually of extreme interest. Moreover, from a 
statistical point of view, they can be considered as a state space representation of a first order 
multivariate time series model and many examples in the literature can be represented by these 
models. 

At a first sight this problem does not look very original and worth to be studied in a new 
paper. However, under some very natural conditions on the coefficients / and g, like absence 
of continuity and possible local singularity of the matrix valued function g, to the best of our 
knowledge no general results have been published so far, except for those present in our former 
paper [5], in the case n = k = 1. It is worth to remark that several paper deal with similar 
or more general models (see e.g. [1], [2], [9], [10] and [I2]), but in all these papers stronger 
assumptions are required on g, as everywhere continuity and non degeneracy. 
The aim of this paper is to provide a first step in order to fill this gap in the literature. We 
will restrict ourselves to the case n = k and we will assume that the noise random variables ct 
possess a strictly positive density on M"". Under these conditions, we will be able to obtain for 
this class of models results similar to those that hold in the regular case, adapting some of the 
standard techniques applied to the smooth version of the present equation. As an application, 
we shall consider a BEKK-ARCH(l) multivariate model (see Engle and Kroner [3], Hansen 
and Rahbek [7] and Saikkonon ^12]) and obtain a set of sufficient condition to be this process 
geometric ergodic. 

The paper is organized as follows: in Section 2 we will present the model and recall some 
notation and known results. In Section 3 we will consider the problem to find out sufficient 
conditions to be the solution an aperiodic, irreducible, T-chain. This part is fairly technical, 
but these three properties are the fundamental ingredient in order to apply the well known 



Foster-Lyapounov drift criteria of Section 4. This technique allows us to determine a set of 
sufficient conditions to be the solution an ergodic process. In the last section we shall apply the 
results to a BEKK-ARCH(l) model with a general nonlinear autoregressive term. 
By A„ we will denote the Lebesgue measure on M". By A° we will denote the interior of the 
set A. For p > 1, \\ ■ \\p will denote the Ip norm on M". For < s < 1 and x € M", we will 
define ||a;||s = Y27=i \^i\^'j this is clearly no more a norm, but it still defines a pseudometric on 
M", since the triangular inequality holds true. With |||'||| we will denote a generic matrix norm 
on M""^""; for p > 1, |||-|11 will denote the operator norm associated with || • ||p, while |||-|||;^„ will 
denote the maximum column sum, matrix norm associated with || • ||p, whose definition is as 



follows: 



ll^llli,P = max lla.jllp 



We will use the same definition when < s < 1, even if again the function |||-|||^ ^ will not be a 
norm anymore. Finally, by |||-|||^ we will denote the Frobenius norm 




ll^lll^ 



(see [8J, Section 5.6 for a complete account on this topic). 

For a map G : X x Y ^^ Z, we shall denote by Gx the x-section of G, namely Gx{y) '■= 

G{x,y), while, given Bi,...,Bt G M", we shall denote Bi-^t = Bi x ■ ■ ■ x Bt and, similarly, 

ui;t = {ui,U2,... ,ut) G M"*. 



2 The multidimensional stochastic difference equation 

In this paper we will study nonlinear stochastic difference equations defined by the system 

Xt = f{Xt-i) + g{Xt-i)eu t>l 



where / : M" — >■ M", g : M" — )• M"^", {et,t G N} is a sequence of independent, identically 
distributed n-dimensional random vectors and ^ is a given random vector. In the discrete time 
case we have clearly no problems related to the existence and uniqueness of the solution, while 
a fundamental question is if the system is ergodic, which is related to the fact that it admits an 
invariant distribution. We will be able to prove that, under some assumptions on the coefficients 
/ and g and on the law of the noise sequence, the solution to ([1]) is geometrically ergodic. 
Let us start by considering the regularity of the coefficients / and g. In this paper, we consider 
the case with both the coefficients not everywhere continuous and the matrix g(x) locally sin- 
gular. This last assumption is not weird: in the scalar case this means that the function g could 
be zero somewhere and this is indeed the case if we choose g to be an affine function. Never- 
theless in the literature g is usually assumed to be non singular and a possible set of hypothesis 
(see Liebscher [IQ]) is that there exist two constants Ci,C2 > such that |(7^"'^(x)| < Ci and 
|det(g(x))| < C2 on every compact subset of M" (see also Saikkonon [12] for the BEKK-ARCH 
model). 

In the present paper, denoting by := {x € M" : det{g{x)) 7^ 0} the set of "regular" points 
of g and by Cf (resp. Cg) the set of the continuity points of the function / (resp. g), we will 
require that the following two assumptions are satisfied: 

(H.l) / and g are locally bounded and the sets 0, Cf and Cg have not empty interior. 

Under smooth conditions on the coefficients / and g, one fruitful approach is to use the concept 
of the forward accessibility from the control theory and its equivalence with the much more 
workable Rank condition (see e.g. Meyn-Tweedie [Hj, Chapter 7). Since in our case we do not 
assume differentiability of the coefficients, we have to find out a different approach, even if our 
property will be at the end stronger, but not so far from the forward accessibility. Let us denote 
by O C M" the support of the random vector ei, F{x,u) = f{x) + g{x)u and, inductively, for 
t GN+ 

F*+^(xo,ni,...,nt+i) :=F(F*(xo,ni,...,tit),nt+i) . (2) 

When t = 0, F^{xo,ui, . . . ,ut) = xq. 



Our second assumption will be that 

(H.2) For any xq 0, there exists t G N"*" and ui, . . . ,Ut € O such that F^{xo,ui, . . . ,ut) € 
{enCfnCgY andF^~^{xo,ui,...,ut-i) eCfDCg. 

Remark 1 Under (H.2) and assuming that O has non-empty interior, we easily obtain that for 
any xq, there exists t € N"'' and ui, . . . ,ut £ O such that F^t^-^ uti^) ^^^ non-empty interior. 
This condition implies, but is evidently stronger than the forward accessibility, which requires 
that for any xq E M", Ut=^^xo(^*) ^^^ non-empty interior. 

Remark 2 // we assume that f and g are continuous, a sufficient condition in order to satisfy 
assumption (H.2) is that for any x G M", there exists t E N such that /*(x) G 0°. 

To conclude, let us state the assumptions on the noise sequence {et,t G N}. The price to be paid 
for the weak hypothesis (H.l) is quite expensive, since we have to assume absolute continuity 
and lower semicontinuity of the noise density. However, this is often the additional condition 
that we have to ask in order to allow some kind of singularity in the coefficient g (see e.g the 
results on the bilinear processes in [13] and [3], the threshold bilinear processes in [5j and the 
nonlinear state space models in |llj). 

(H.3) {et,t G N} is a sequence of independent, identically distributed n-dimensional random 
vector, absolutely continuous w.r.t. Lebesgue measure A„ on B{W^), with density p{-) 
strictly positive almost everywhere and lower semicontinuous. 

3 Irreducibility, aperiodicity and T-chain property 

In order to apply the classic Foster- Lyapunov drift criteria of the next section (see Meyn-Tweedie 
[llj for a comprehensive introduction to this topic), we need three basic ingredients. Indeed, 
we have to prove that the Markov chain, solution to ([1]), is 99-irreducible, for a given measure 
(/?, aperiodic and a T-chain. While it is usually not too difficult to find out a set of reasonable 
conditions that ensure that the process is c^- irreducible and aperiodic, it is more challenging to 
handle the T-chain condition. Most of the papers in the literature do not spend much time on 



this part of the study and the authors usually state some general conditions that ensure the 
process to satisfy these three properties (see e.g. Liebscher [10], Section 4). Since in this paper 
we would like to allow the diffusion coefficient g to be locally singular, we shall need to impose 
the previous set of stronger assumptions (H.1)-(H.3). 

Let us start by stating a simple result, that we will need in the sequel and whose proof is 
immediate (see [5]). 

Lemma 3 Let A,B CI R"; if F : A —^ B is a continuous function and G,H : B ^M are two 
lower semicontinuous (Isc) functions, then G o F and G ■ H are Isc. 

We are now able to prove the main result of this section: 

Proposition 4 Under {H.\) — {H.2>), the process solution to ([2]j is a \n- irreducible, aperiodic 
T- chain. 

Proof. A„-irreducibility: we have to prove that for any A G BiW"-), such that \n{A) > 0, 
and any x E M", there exists t = t{x,A) £ N such that P\x,A) = F[Xt G A\Xo = x] > 0. If 
det{g{x)) 7^ 0, we get P{x,A) > 0. Otherwise, by assumption (H.2) we get that there exists 
t € N and ui,. . . ,ut € M" such that F*(x, ui,. . . , Ut), defined in ([2]), belongs to (9 D Cg)° , and 
is continuous in (x,tii, . . . ,uj). Therefore, there exist open balls Bi, . . . ,Bt in M" such that 
F\x, Bu...,Bt)C e". By (H.1)-(H.3) we get 

P'+Hx, A)=¥ [f{F\x, ei;i)) + g{F\x, ei..t))et+i S A] 



> 

'Bl:t 



\det{g{F\x,ui.,t)))\ ^ p{g{F\x,ui..t))-\ut+i - f{F\x,ui.,t))))dut+i 



A 



p{ui) ■ ■ ■ p{ut)dui.t > C3 / p{g{F\x,ui:t)) '^{ut+i- f{F\x,ui;t))))x 

J A ^JBi-.t 



p{ui) ■ ■ ■ p{ut)dui;t dut+i > , 
where C3 := inf^j.^g^i.t \dQi{g{F^{x,ui-t)))\ < 00, and the A„-irreducibility is proved. 



Aperiodicity: we will prove that the solution process is strongly aperiodic, i.e. that there 
exist a nontrivial measure i^i on jB(M") and a subset A £ B{M.'^), with i^i^A) > 0, such that for 
any x e A and B e B{W"-), P{x,B) > vi{B). Let us take x G (0nC/nCg)"; by the assumption 
(H.l) we get that there exists an open bounded neighborhood Aoi x and two positive constants 
ci , C2 such that 

0<ci<|det(5(y))| <C2 

for any y G A. By (H.2) we get 

P{x,B)= [ \det{g{x))\-^ p{g{x)-\y - f{x)))dy 
Jb 

>C2^ f pigix)-\y-fix)))dy>c^^kiXniAnB) , 
JAnB 

where < A;i = inf p{g{x)~ {y — f{x))). The result holds for z^i(-) = C2 kiXn{A n •). 
T-chain condition: By Proposition 6.4.2 in Meyn and Tweedie [11], it will be sufficient to 
prove that for each x € M", there exists a t G N and a non trivial substochastic transition 
kernel Tx{-,-). l.s.c. in the first variable, such that P^{y,A) > Tx{y,A) for each y £ M" and 
A G B(W'). Let X G M": by assumption (H.2) we get that there exist t G N and m,... ,ut eW 
such that F*{x, ui, . . . , ut) G (B n C/ PI Cg)° and t + 1 open bounded sets i?o, Bi, . . . , Bt of M", 
such that F*{Bq, Bi, . . . , Bt) C Q°. Moreover, we can assume that / and g are continuous on 
F^{Bo, Si, ... , Bt). Hence, for y (£ Bq and A e S(M"), we get 

P'+\y,A) =F[f{F\y,ei..t)) + g{F\y,ei..t))et+i e A] 

\det{g{F\y,ui.,t)))\-'p{giF\y,ui..t))-Hut+i-f{F\y,ui.,t))))dut+i 



> 



p{ui) ■ ■ ■ p{ut)dui;t > C4 

I A 



p{g{F\y,ui..t))~\ut+i - f {F\y , m-.t)))) x 

But 



p{ui) ■ ■ ■ p{ut)dui;t dut+i =: T{y, A) 



where C4 := inf(j^u^.j)g5y.^ \det{g{F^{y,ui:t)))\ < 00. By Lemma [3] and Fatou's Lemma we get 
that T(y, A) (for y € Bq) is a Isc function and we can define the substochastic transition kernel 
Tx{y,A) := (j){y)T (y , A) , with </>(•) a smooth function whose support is contained in Bq. It is 
clear that -P*(y, A) > Tx{y, A) for each y € M" and A G S(M") and the proof is complete. I 



We conclude this section by considering a simple bivariate time-series, solution of a two dimen- 
sional difference equation, where is present a threshold and a singular part. 

Example 5 Let us take n = 2 and consider the difference equation |7P, with f and g defined 

as follows 

ai\ I 611 612 \ X 

fix,y)= + 

{0.2} yhi ^22/ \y , 

and 

(du-x di2-y\ I dsi-x 0\ ( d^ 

9[x,y) = 1r2\c+ 1c + 

\d2i -x d22- yj 1^32 • y OJ W42 

where C = {x < 0,y < 0} and diid22 — ^12^21 7^ 0. Note that the matrix g{x,y) is singular for 
{x,y) £ C L) Di and could be singular for (x,y) € D2, where Di := {(x,y) : x = 0,y > 0} and 
L>2 := {{x,y) : X > 0,y = 0}. Moreover, g is not continuous on the boundary of C . It is easy 
to prove that if f and g are such that 

dud42 - ^21^41 / , d3idi2 - d32dil / (3) 

and the noise sequence admits R" as its support, then the hypotheses (H.1)-(H.2) are satisfied. 
Indeed, the only non trivial part is (H.2), which is always satisfied since, for {x,y) G C, there 
exists u £ M? such that f{x, y) + g{x, y)u ^ C U Di U D2, while for {x, y) € Di, i = 1,2, there 
exist u, V G M^ such that f{f{x, y) + g{x, y)u) + g{{f{x, y) + g{x, y)u))v ^ Di . 



4 Geometric ergodicity 

In this section we will obtain, in a standard way, a set of sufficient conditions on the coefficients 
of the difference equation in order to be the solution process geometrically ergodic. Due to the 
weak assumptions on this coefficients, we will obtain a rather strict sufficient condition, but 
this is in line with previous results in the literature. Since our approach to prove the geometric 
ergodicity is based on the choice of a drift function, for specific models like the BEKK-ARCH(l) 
model in the next section, it could be more convenient to use a different function, but the proof 
will be similar to the one presented here. 

Let us consider the process {Xt,t > 0} solution to equation ^ and assume that it is a A„- 
irreducible, aperiodic T-chain. In order to apply the classic Foster-Lyapunov drift criteria for 
V{x) = 1 + ||x||s, when s > 0, we will need to apply some easy properties of the functions || • \\s 
and |j|-|||^ ^. Given A, B two n x n real matrices and x G M", it holds that 



\\AB\l ^ < \IA\IJIB\1 ^ 



and that 

for any s > 0; these results are well known for s > 1 and immediate to be proven for s < 1. 
Note that, the same properties hold true for |||-|1|^ instead of |||-|||;^ ^, when s > 1, and furthermore, 
when s = 2, for ||M||p instead of IML q. 

' III llli^ III llll,z 

We are now ready to provide an easy to check set of sufficient conditions in order to be the 
solution process geometric ergodic. 

Proposition 6 Let {Xt} be the solution process of ^^ and assume that it is Xn-irreducible, 
aperiodic, T-chain. If for s > 

i. f and g are locally hounded; 



ii. there exist a j > and M, ag,bf,bg > such that 

\\f{x)\\s <af + bf\\x\\s , \lg{x)\l-^^^^<ag + bg\\x\\s 

for any x S M" with \\x\\s > M ; 

Hi. J = bf + bg E[||ei||s] < 1 ; 

then {Xt,t € N} is geometrically ergodic. 

Furthermore, if the previous conditions hold for s > 1, then each component of the stationary 

distribution has finite moments up to order s. 

Proof. Let us consider the function V{x) = 1 + ||x||s for an arbitrary s > and the compact 
set C = {x € M" : ||2;||s < M}. Since the solution is a A„-irreducible T-chain, we have that C 
is also petite. 
By triangular inequality and assumptions i.-iii. we obtain 

E[V{Xt)\Xt^i=x] < {bf + bgE[\\et\\smi + \\x\\s) + 

+ af + agE[\\et\\s] + 1 - (6/ + bgE[\\et\\s]) 

for every x € M" such that \\x\\s > M, and 

E[V{Xt)\Xt-i =x] < 6a/ < oo V X E C7. 

Summarizing, for any x G M" it holds 

E[V{Xt)\Xt-i=x] < {bf + bgE[\\et\\s])V{x) + bMlc. 

If bf + 6gE[||e(||s] < 1, applying Lemma 15.2.8, Theorem 15.0.1 and Theorem 14.0.1 in Meyn 
and Tweedie [llj, we get that {Xt} is a geometrically ergodic Markov chain and, when s > 1, 
that the moments of the components of the stationary distribution are finite up to order s. I 
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Remark 7 A simple extension to the previous result can be obtained by taking, for s > 1, |||-|||g 
or f-Wp instead of |||-|||xs- More generally, any norm on M" (and the corresponding induced 
matrix norm) could be considered. Indeed, in the proof the only property of the matrix norms 
that we use is \\g{x)et\\s < Ill5'(2^)llli s Iktlls which holds true for these matrix norms too. Our 
choice of the matrix norm [|[-[||-^ ^ is justified by the possibility to use in assumption Hi. absolute 
moments of order smaller then 1, which weakens the restriction on the noise. 

Example 8 Let us consider the threshold model of Example\^ We will assume that the i.i.d. 
random sequence {e(,t G N} is distributed according to an Expol(2) law (see 16]) with density 
p{x,y) oc exp(— (x^ — 1)^ — (y^ — 1)^). This density presents four global maxima and one local 
minimum and is straightforward to simulate by standard methods. In order to apply the results 
in Proposition l^l it is easy to check that Assumption ii. hold true, when s = 1, for 

bf = max{|6ii| + |62i|, 1^12 1 + 1^221} 

and 

bg = max{|dii| + \d2i\, l^sil, 1^321, \di2\ + |c^22|} 

Since E[||ei||i] ~ 1.66, it easy to see that the previous model with 



ai \ / 0.2 0.1 \ x 
02 lo.l 0.3) ly, 



and 

I Q.lx -0.15y I [ Q.2x 0| [1 

9{x,y) = li;2\c'+ P'^ + 

\ -0.15x O.ly / \ -0.25y / \ 1 

satisfies the assumptions of Proposition \^ with bf + bg E[||ei||i] ~ 0.981. On the other 
hand, if we modify the coefficients for the conditional mean, for example taking bn = 622 = 
1,612 = 621 = 0, the previous assumptions are no more satisfied and the distribution of the 
solution process does not converge to any distribution, as can be seen by simulation. The same 
result is obtained if we modify the coefficients for the conditional variance, for example taking 

11 



dii = di2 = d2i = ^22 = 0.4. Nonetheless simulation of the limit distribution for other set 
of values shows that even if the sufficient condition is (slightly) violated, the model still appear 
ergodic, but as pointed out before, Assumption Hi. is strong. 

5 Multivariate BEKK-ARCH(l) models with nonlinear autore- 
gressive terms 

In this final section we will consider a locally degenerate multivariate BEKK-ARCH(l) model, 
with a nonlinear autoregressive term. This model belongs to the multivariate BEKK-GARCH 
class, first proposed by Engle and Kroner in [^, which is particularly useful in multivariate 
financial time-series, since allow to model both the variances and the covariances. Contrary to 
all previous works, we will ask that the matrix valued coefficient will be just positive semidefinite 
and we will be able to derive simple sufficient conditions for the regularity and geometric 
ergodicity of the solution process. 
Let us consider a process {Xt,t > 0}, solution to the following difference equation: 

Xt = f{Xt-i) + {B + {AXt-i)iAXt-ify/hu t>l (4) 

where / : M" —^ M", B is a positive semidefinite n x n real matrix, ^ is a n x n real matrix and 
{et,t G N} is a sequence of independent, identically distributed n-dimensional random vectors. 
Since {Ax){Ax)'^ is, for any x € M", a positive semidefinite n x n real matrix, B + {Ax){Ax)'^ 
is itself a positive semidefinite matrix and there exists a unique, positive semidefinite square 
root (see p]. Chapter 7). The process Xt, solution to dH) is called in the time-series literature 
a BEKK-ARCH(l) model and its ergodicity has been considered just for the regular case, 
i.e. assuming that the matrix B + {Ax){Ax)'^ is positive definite and its smaller eigenvalue is 
uniformly bounded away from zero (see Saikkonon |12j). 

To determine a set of sufficient conditions to be the solution process a A„-irreducible, aperiodic, 
T-chain, we will consider for simplicity the case n = 2. Let us assume that the matrix B 
will be non zero and positive semidefinite, but not positive definite. In this case the matrix 

12 



{B + {Ax){Ax)'^y''^ will be positive semidefinite, but could be not positive definite. Indeed, 
the determinant of the matrix valued function g[x) = B + {Ax){Ax) is equal to 

det(g(x)) = feii(a2ia;i + 0222:2)^ + ^22(«iia;i + 012X2)^ - 2612(0212:1 + a22a^2)(oiia^i + «122;2) 

and a simple computation show that this determinant is zero for any (xi,X2) G K^ if both 
011622 — 021&11 and 012^22 "^22^11 are zero, while otherwise it is zero just on a straight line 
L = {(xi,X2) G R^,axi = /3x2}, for suitable constants a and /3. With the notation of Section 
2, we get that in the latter case the set of regular points of g, 0, coincides with L'^. Moreover, 
the function x — )• (i? + {Ax){Ax) Y'"^ is continuous li B + {Ax){Ax) is positive definite. 
A set of sufficient conditions for the assumptions (H.l) and (H.2) will be as follows: 

I/O -| /o "I /O 1 /9 

(B.l) At least one between 011^22 ~ '3^21^11 and 012^22 —022611 is different of zero and for any 
X G L = {x G M^ : dei{g{x)) = 0}, there exist u^v ^^ such that y = f[x)+g{x)u G CjCiL'^ 
and f{y) + g{y)v G (C/ n L^)°, with Cj + 0. 

Remark 9 Clearly, for a specific choice of the autoregressive term f , one can provide better 
sufficient conditions in order to he the assumptions (H.l) and (H.2) satisfied. 

The next result follows as a simple corollary of previous Proposition SJ 

Proposition 10 Let n = 2 and assume that (B.l) and {H.3) are satisfied. Then, the process 
solution to ^ is a Xn -irreducible, aperiodic T-chain. 

Let us now consider the geometric ergodicity: in order to apply the results of the previous 

sections, we will use here the Frobenius matrix norm. In fact we will make use of the fact that 

1/2 _ ,.TA^^y^ 



iY17j=i ^Ij ) ~ {tr{A'^A)^ , which gives in our case that 



(S + (Ax)(Ax)^)i/2 ^ =tr{B) + tr{{Ax){Axf) 



A set of sufficient condition for the geometrically ergodicity of the present model follows as a 
simple modification of previous Proposition 6, whose simple proof we omit. 

13 



Proposition 11 Let {Xt} be the solution process of ^ and assume that it is a Xn-irreducible, 
aperiodic, T- chain. If 

i. f is locally bounded; 

a. there exist a/ > and M, bf > such that 

||/(x)||2 < a/ + 6/||x||2 

for any x S M" with \\x\\2 > M ; 

Hi. 'y = bf + l\A\lp E[||ei||2] < 1 ; 

then {Xt,t € N} is geometrically ergodic. 

Furthermore, the components of the stationary distribution have finite second moments. 

Remark 12 More general conditions for geometric ergodicity are present in the literature of 
the BEKK-ARCH models (see, for instance, \1^). but a basic ingredient of all the proofs is that 
the matrix B + {Ax){Ax)'^ will be positive definite. 
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